Learning to Fight

نویسندگان

  • Thore Graepel
  • Ralf Herbrich
  • Julian Gold
چکیده

We apply reinforcement learning to the problem of finding good policies for a fighting agent in a commercial computer game. The learning agent is trained using the SARSA algorithm for on-policy learning of an action-value function represented by linear and neural network function approximators. We discuss the selection and construction of features, actions, and rewards as well as other design choices necessary to integrate the learning process into the game. The learning agent is trained against the built-in AI of the game with different rewards encouraging aggressive or defensive behaviour. We show that the learning agent finds interesting (and partly near optimal) policies in accordance with the reward functions provided. We also discuss the particular challenges arising in the application of reinforcement learning to the domain of computer games.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Fight against Spam - A Machine Learning Approach

The paper presents a brief survey of the fight between spammers and antispam software developers, and also describes new approaches to spam filtering. In the first two sections we present a survey of the currently existing spam types. Some well-mapped spammer tricks are also described, although the imagination of spam distributors is endless, and therefore only the most common tricks are covere...

متن کامل

Gray’s Reinforcement Sensitivity Theory: a psychometric critique

This study identifies valid orthogonal scales of Gray’s animal learning paradigms, upon which his Reinforcement Sensitivity Theory (RST) is based, by determining a revised structure to the Gray–Wilson Personality Questionnaire (GWPQ) (Wilson, Gray, & Barrett, 1990). It is also determined how well Gray’s RST scales predict the surface scales of personality, which were measured in terms of Eysenc...

متن کامل

How May E-Learning Groups Interact?

How may e-learning groups interact? According to Wheelan (1994), for physical groups there are several types of interaction: dependency, flight, fight, counter-dependency, pairing, counter-pairing and work. Our study adopts the content analysis approach to investigate how virtual groups from a cyber-university interact over a period of twenty weeks. Our findings suggest that those e-learning gr...

متن کامل

Hypertension Prediction in Primary School Students Using an Ensemble Machine Learning Method

Introduction: The prevalence of hypertension in children is increasing, and this complication is considered the most important risk factor for cardiovascular diseases in older age. Early detection and control of hypertension can prevent its progress and reduce its consequences. Machine learning methods can help predict this complication promptly and reduce cost and time. This study aimed to pro...

متن کامل

Malware and Machine Learning

Malware analysts use Machine Learning to aid in the fight against the unstemmed tide of new malware encountered on a daily, even hourly, basis. The marriage of these two fields (malware and machine learning) is a match made in heaven: malware contains inherent patterns and similarities due to code and code pattern reuse bymalware authors; machine learning operates by discovering inherent patter...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004